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DNA methylation is an important epigenetic modification that is 
essential for various developmental processes through regulating gene 
expression, genomic imprinting, and epigenetic inheritance! ^. Mam- 
malian genomic DNA methylation is established during embryogen- 
esis by de novo DNA methyltransferases, DNMT3A and DNMT3B°%, 
and the methylation patterns vary with developmental stages and cell 
types". DNA methyltransferase 3-like protein (DNMT3L) is a cat- 
alytically inactive paralogue of DNMT3 enzymes, which stimulates 
the enzymatic activity of Dnmt3a". Recent studies have established 
aconnection between DNA methylation and histone modifications, 
and revealed a histone-guided mechanism for the establishment of 
DNA methylation". The ATRX-DNMT3-DNMT3L (ADD) domain 
of Dnmt3a recognizes unmethylated histone H3 (H3K4me0)'^". 
The histone H3 tail stimulates the enzymatic activity of Dnmt3a in 
vitro™”’, whereas the molecular mechanism remains elusive. Here 
we show that DNMT3A exists in an autoinhibitory form and that 
the histone H3 tail stimulates its activity ina DNMT3L-independent 
manner. We determine the crystal structures of DNMT3A-DNMT3L 
(autoinhibitory form) and DNMT3A-DNMT3L-H3 (active form) 
complexes at 3.82 and 2.90 À resolution, respectively. Structural and 
biochemical analyses indicate that the ADD domain of DNMT3A 
interacts with and inhibits enzymatic activity ofthe catalytic domain 
(CD) through blocking its DNA-binding affinity. Histone H3 (but not 
H3K4me3) disrupts ADD-CD interaction, induces a large movement 
ofthe ADD domain, and thus releases the autoinhibition of DNMT3A. 
The finding adds anotherlayer of regulation of DNA methylation to 
ensure that the enzyme is mainly activated at proper targeting loci 
when unmethylated H3K4 is present, and strongly supports a nega- 
tivecorrelation between H3K4me3 and DNA methylation across the 
mammalian genome?'^!??, Our study provides a new insight into an 
unexpected autoinhibition and histone H3-induced activation of the 
denovoDNA methyltransferase after its initial genomic positioning. 

To investigate how DNMT3A activity is regulated, we performed an 
in vitro DNA methylation assay using recombinant DNMT3A2 (residues 
224-912), a catalytically active variant of DNMT3A consisting of Pro- 
Trp-Trp-Pro (PWWP), ADD, and CD domains (Fig. 1a)*?. H3K4me0 
(but not H3K4me3) peptide significantly stimulated the enzymatic acti- 
vities of DNMT3A2 or DNMT3A2 in complex with the CD-like domain 
of DNMT3L (designated لل‎ (Fig. Ib and Extended Data Fig. 1a, b). 
A similar effect was observed for DNMT3A2 or DNMT3A2-CPNMPSE 
using recombinant poly-nucleosomes carrying unmethylated histone 
H3 or H3K4me3 mimic (H3Kc4me3) (Fig. 1b and Extended Data Fig. 1c). 
Thus, the enzymatic activity of DNMT3AQ2 is stimulated by histone H3 
(but not H3K4me3) tail either in the form of free peptide, or within nucle- 
osome, in a DNMT3L-independent manner. 


We next characterized the critical regions for histone H3-induced 
activation of DNMT3A. H3K4me0 peptide activated DNMT3A2 and 
ADD-CD (residues 476-912) proteins with comparable (approximately 
sixfold) activity enhancement, but did not stimulate activity ofthe CD 
domain (residues 627-912) in the absence or presence of cot 
(Extended Data Fig. 1d, e). ADD-CD protein with histone H3 peptide 
(residues 1-20) fused at the amino (N) terminus (H3-ADD-CD) showed 
significantly higher activity than ADD-CD, and could not be further 
activated by H3K4me0 peptide (Extended Data Fig. le). These results 
collectively indicate that the ADD domain (but not the PWWP domain) 
is required for histone H3-induced activation of DNMT3A. 

Notably, the CD domain or CD-CP"MP*t complex showed compa- 
rable activity to that of ADD-CD or ADD-CD-CP"M7*' activated by 
histone H3 peptide (Extended Data Fig. 1d, e), suggesting that the ADD 
domain inhibit the activity of the CD domain and histone H3 release 
this inhibition. In support of this hypothesis, addition of ADD-linker 
(residues 476-626, Fig. 1a), but not H3-ADD-linker fusion protein inhib- 
ited the activity of the CD domain (Fig. 1c). These results confirm the 
existence of autoinhibition of DNMT3A, in which ADD-linker directly 
inhibits the enzymatic activity of the CD domain, and that this inhibi- 
tion is released by histone H3 tail. 

We determined the crystal structure of ADD-CD of DNMT3A in com- 
plex with CPNMT3E ( ADD. CD- CPNMPTE, at 3.82 À resolution (Fig. 1d 
and Extended Data Table 1). Although only one ADD-CD-CPNMT3L 
complex was observed within an asymmetric unit of the crystals, gel- 
filtration analysis indicated a tetramer formation in solution. Two ADD- 
CD-CPNMT*" complexes form dimer of dimers via the CD-CD inter- 
action in a two-fold crystallographic symmetry (Extended Data Fig. 2). 
The complex structure adopts an X shape with two CD domains located 
in the centre and the ADD and CP™™P domains located in the four 
corners. In the ADD-CD-CPNMT structure, CD- CPDNMTSE adoptsa 
similar fold to that of mouse CDP?™32_CP"™°F structure (Extended 
Data Fig. 3a)”. 

The ADD and CD domains fold into two individual structural mod- 
ules connected by the linker (Fig. 1d). The linker forms a twisted helix 
followed by an extended loop, which packs against a hydrophobic surface 
of the CD domain (Extended Data Fig. 3b, c). A loop region (residues 
526-533) extends out of the ADD domain and inserts into a pocket 
in the CD domain (Fig. le and Extended Data Fig. 3d). This pocket is 
mainly formed by basic residues R790, R792, H789, and R831 of the 
CD domain. Three acidic residues (D529/D530/D531) and hydropho- 
bic residues (Y526/Y528/Y533) of the ADD domain are brought into 
close proximity to the pocket. All residues that are involved in the intra- 
molecular interaction are highly conserved in DNMT3A/3B, suggest- 
ing a conserved ADD-CD interface among DNMT3A/B sub-family 
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Figure 1 | Structure of the DNMT3A-DNMT3L complex in autoinhibitory 
form. a, Colour-coded domain architecture of human DNMT3A and 
DNMT3L. b, Relative activities of DNMT3A2 using peptides or reconstituted 
nucleosomes as substrates in the absence or presence of CPNMTSL ¢, Activities 
of the CD domain in the absence or presence of ADD-linker or H3-ADD- 
linker fusion proteins. d, Ribbon representations of the overall structure of the 
ADD-CD-CP*MT- complex. AdoHcy are shown in stick representation; 
zinc cations are shown as grey balls. The colour scheme is the same as in a and 
used in all structural figures. e, Close-up view of the ADD-CD interface. 
Critical residues for the interactions are shown in stick representation. Relative 
activities were calculated according to basal activity for each protein in the 
absence of H3 peptide. Error bars, s.d. for triplicate experiments. 


members, but not DNMT3L (Extended Data Fig. 4). Structural com- 
parison also indicated that ADD?N™T?” would have steric hindrance 
with CPNMT" if DNMT3L adopts a similar conformation to that of 
DNMT3A (Extended Data Fig. 5: 

The intramolecular interaction was verified in the in vitro glutathione 
S-transferase (GST) pull-down assays, in which ADD-linker bound to 
the CD domain (Fig. 2a and Extended Data Fig. 5a-e). Mutations D529A, 
D531A, D529A/D531A, and Y526A/Y528A of ADD-linker, decreased 
the binding affinity to the CD domain, supporting their important role 
in mediating the ADD-CD interaction. In contrast, mutations R556A, 
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M548W, D530A, and Y533A of ADD-linker showed little effect on ADD- 
CD interaction (Fig. 2a and Extended Data Fig. 5a, b). Mutation R790A/ 
R792A ofthe CD domain also impaired its binding affinity to the ADD- 
linker protein (Extended Data Fig. 5c). H3K4me0 (but not H3K4me3) 
peptide hampered ADD-CD interaction, suggesting that histone H3 
tail releases autoinhibition of DNMT3A through disrupting this intra- 
molecular interaction (Fig. 2b and Extended Data Fig. 5d). 

Wenext mapped the critical regions for the autoinhibition of DNMT3A 
in the presence of CPAM (Fig. 2c). Compared with ADD-CD (Del- 
N476), Del-N522 showed a slight increase in enzymatic activity, whereas 
Del-N550, Del-N587, and Del-N610 significantly increased their activ- 
ities (Fig. 2d). Although ADD and ADD-linker showed comparable bind- 
ing affinity to the CD domain (Extended Data Fig. 5e), only ADD-linker 
(ADD-626) maintained the inhibitory function, while all other truncated 
proteins markedly decreased their inhibition on the activity of the CD 
domain (Fig. 2e). Replacement of residues 621-632 by a 12-residue Gly 
and Ser (GS) linker in ADD-CD partly released autoinhibition and showed 
less than twofold activity enhancement upon histone H3 stimulation 
(Extended Data Fig. 5f). The results indicate that the ADD domain and 
linker are both important for the autoinhibition of DNMT3A. 

We next tested whether residues on the ADD-CD interface are impor- 
tant for the autoinhibition of DNMT3A. Compared with wild-type pro- 
tein, mutations D529A, D531A, and Y526A/Y528A of the ADD-linker 
could barely inhibit the activity of the CD domain (Fig. 2f and Ex- 
tended Data Table 2). Mutations D529A, D531A, and Y526A/Y528A of 
DNMT3A2 showed enzymatic activities that are comparable to or slightly 
lower than that of corresponding mutants activated by histone H3 pep- 
tide (Fig. 2g). A more obvious release of autoinhibition was observed in 
assays using recombinant poly-nucleosomes as substrate (Fig. 2h). As a 
negative control, mutation M548W of DNMT3AQ2 remained in an inhib- 
itory form and was not activated by histone H3 tail because the mutant 
could not bind to histone H3 (ref. 18). Taken together, the ADD domain 
and linker function together to inhibit the activity of the CD domain, 
and ADD-CD association is essential for ADD-mediated autoinhibition 
of DNMT3A. 

Hhal is a well-characterized bacterial DNA (cytosine-5) methyltrans- 
ferase?. Structural comparison of ADD-CD-C?\™™*" and Hhal-DNA 
(Protein Data Bank (PDB) accession number 1MHT) complexes indi- 
cates that their catalytic domains adopt similar fold, and suggests that 
loops L1 and L2 of DNMT3A are potential regions for DNA interac- 
tion (Fig. 3a-c and Extended Data Fig. 4). Consistently, mutating basic 
residues K831, R836/N838, K841, and K844 on loop L2 impaired the 
enzymatic activity of DNMT3A (Fig. 3c and Extended Data Fig. 6a)". 
In the ADD-CD-CPhMT*. structure, the ADD domain is located close 
to the loop L2 and may thus generate steric hindrance for DNA inter- 
action (Fig. 3c). 

In support ofthe analysis above, ADD-CD markedly decreased, but 
H3-ADD-CD showed comparable DNA-binding affinity to that ofthe 
CD domain (Fig. 3d). Histone H3 peptide partly restored DNA-binding 
affinity of ADD-CD or DNMT3A2, which may have resulted from 
relative weak interaction between histone H3 peptide and DNMT3A2 
(Fig. 3d, e and Extended Data Fig. 6b). Consistent with their effect on 
the release of autoinhibition, mutations D529A, D531A, and Y526A/ 
Y528A of DNMT3A2-C°™™* partly rescued the DNA interaction 
(Fig. 3e), and histone H3 (but not H3K4me3) peptide increased the com- 
plex formation of ADD-CD-DNA, but not CD-DNA (Fig. 3f). Similar 
results were observed in electrophoretic mobility-shift assays (Extended 
Data Fig. 6c). Taken together, the ADD domain inhibits the activity of 
the CD domain by decreasing its DNA-binding affinity. This inhibition 
is restored by fusion ofthe histone H3 tail at the N terminus of ADD-CD, 
and is partly restored by addition of histone H3 peptide or mutations 
of residues on the ADD-CD interface. 

We also determined the crystal structure of ADD-CD-C 
complex with histone H3 peptide (residues 1-12) at 2.90 À resolution 
(Extended Data Table 1). The complex structure adopts a butterfly shape, 
with the ADD and CON"?! domains resembling the wings (Fig. 4a). In 
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the complex structure, two ADD-CD-CPNMT3L_H3 complexes forma 
dimer of dimers through CD-CD interface in a pseudo-two-fold s e- 
try. CD-CDP*MP*t adopts a similar fold to that of ADD-CD-CP "MP 
(autoinhibitory form) and mouse CpPnmtsa_CPnmtsL Structures, whereas 
the position of the ADD domain is obviously different in the structures 
of DNMT3A in autoinhibitory and active forms (Fig. 4a, b and Extended 
Data Fig. 7a, b). 

In the ADD-CD-CP*MP- 83 structure, the ADD-CD interaction is 
mediated by a network of hydrogen bonds and hydrophobic interactions 
(Extended Data Fig. 7c, d). A similar domain organization was observed 
in the DNMT3L-H3 structure", suggesting a conserved ADD-CD 
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Figure 2 | ADD-CD interactions are important 
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for autoinhibition of DNMT3A. a, GST 

ADD-609 476 609 pull-down assays for the ADD-CD interactions. 

ADD-614 476 614 Recombinant CD (residues 627-912) proteins were 

ADD-619 476 619 incubated with wild-type or mutant GST-ADD- 

ADD-623 476 623 linker proteins immobilized on glutathione resin. 
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The bound proteins were analysed using SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE) 
and Coomassie blue staining (Extended Data 

Fig. 5a). The assays were quantified by band 
densitometry. Error bars, s.d. for triplicate 
experiments. b, GST pull-down assays in the 
absence or presence of histone H3 peptide 
(H3K4me0 or H3K4me3) as indicated in Extended 
Data Fig. 5d. c, Schematic representation of 
DNMT3A proteins used for in vitro 
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ي ي کي‎ ES e? methyltransferase activity assays. d, Effect of 
بې دې بې يې‎ éi various N-terminal deletions of ADD-CD on its 
CD enzymatic activity. e, Activities of the CD domain 


measured in the presence of various carboxy 
(C)-terminal deletions of ADD-linker proteins. 
f, Activities of the CD domain measured in the 
presence of wild-type and mutant ADD-linker 
proteins. g, h, Activities of wild-type or 
mutant DNMT3A2 measured using naked 
DNA (g) or poly-nucleosomes (h) as substrate. 
Error bars, s.d. for triplicate experiments. 
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interface among DNMT3 family members when they adopt the active 
form (Extended Data Fig. 7e). The histone H3 peptide binds to the 
ADD domain by forming a three-stranded anti-parallel B-sheet with 
two B-strands of the ADD domain (Extended Data Fig. 7f). The side chain 
of H3K4 is stabilized by residues D529 and D531 of the ADD domain, 
and tri-methylation of H3K4 will disrupt the interactions. Mutations 
D529A, D531A, and M548W of ADD-CD abolished their binding affin- 
ity to histone H3 peptide (Extended Data Fig. 7g). 

Comparison of the structures of ADD-CD-C°™" in autoinhibitory 
and active forms indicates two separate surfaces on the CD domain for 
ADD-CD interaction, and suggests a conformational change of DNMT3A 


Figure 3 | The ADD domain 
inhibits DNA-binding affinity of 
the catalytic domain. 

a, Superimposition of ADD-CD- 
CPNMTSL and Hhal-DNA complex 
structures shown in ribbon 
representations with the CONMT" 
domain omitted for simplicity. The 
colour scheme for the comparison is 
indicated. b, c, HhaI-DNA (b) and 
ADD-CD (c) structures are shown 
as in a. d, e, Superimposed 
fluorescence polarization plots for 
DNA-binding affinities of 
truncations (d) or mutants (e) of 
DNMT3A in the absence or 
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Figure 4 | Mechanisms for autoinhibition and histone H3 tail-induced 
activation of DNMT3A. a, Ribbon representations of the overall structure of 
the ADD-CD-C?\MT"_H3 complex. Histone H3 peptides are coloured in 
yellow. b, ADD-CD-CP^MT*t structure in autoinhibitory form are shown 
for comparison. c, Ribbon representations of ADD-CD-CPNMT3L 
(autoinhibitory form) and ADD-CD-CP'MP?- n5 (active form) complex 
structures with the CD domains superimposed. d, Superimposition of 


induced by histone H3 (Fig. 4a-c and Extended Data Fig. 8a). DNMT3A 
may prefer an autoinhibitory form in the absence of histone H3. Upon 
histone H3 association, DNMT3A adopts an active form to open an 
otherwise locked DNA-binding region for establishment of DNA meth- 
ylation. The negative-stain electron microscopy density maps clearly 
showed a distinct shape for DNMT3A2 (residues 275-912)-CPNMTS 
complex in the absence or presence of histone H3 peptide, supporting 
a conformational change of DNMT3A2-CP^MT*' induced by histone 
H3: that is, transforming from an X shape (autoinhibitory form) to a 
butterfly shape (active form) (Extended Data Fig. 8b-d). One-dimensional 
‘°F NMR measurement” also indicated a significant change of chemical 
shift (representing conformational change) for residue F827 (within the 
loop L2) upon histone H3 peptide (but not H3K4me3) titration, whereas 
only a slight change was observed for residue F868 (as a negative control) 
(Extended Data Fig. 8e, f). Collectively, both electron microscopy and 
NMR measurements support the existence of conformational change 
of DNMT3A induced by histone H3 tail. 

Superimposition of H3-ADD (PDB accession number 3A1B) and 
۸112-۲۳ (autoinhibitory form) structures indicated that 
ADD domains in the two structures adopta similar fold, and that histone 
H3 peptide in the H3-ADD structure has no overlap with the ADD- 
CD interface (Fig. 4d). Our previous studies have shown that mutation 
D529A or D531A not only significantly decreased H3-ADD interaction 
(Extended Data Fig. 7g) but also markedly decreased ADD-CD inter- 
action (Fig. 2a), and released the autoinhibition (Fig. 2g, h). Moreover, 
H3K4me0 peptide could disrupt ADD-CD interaction (Fig. 2b). Thus, 
histone H3 releases the autoinhibition of DNMT3A through binding 
to residues ofthe ADD domain on the ADD-CD interface (such as D529 
and D531) and disrupting the intramolecular interaction. The two resi- 
dues function as a critical switch that can exist in both forms of DNMT3A 
and couple histone H3 recognition to the release of the autoinhibition. 
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ADD-CD-CPNMP (CPNMT3L omitted for simplicity) and H3-ADD 
structures. The ADD domain and histone H3 in the H3-ADD structure are 
coloured in grey and yellow, respectively. Residues for H3-ADD and ADD-CD 
interactions are shown in stick representation. e, A working model for the 
autoinhibition and histone H3 tail-induced activation of DNMT3A. DNMT3L 
and the PWWP domain of DNMT3A are not shown for simplicity. 


Here we propose a working model for histone H3-induced dynamic 
regulation of the de novo DNA methyltransferase (Fig. 4e and Supplemen- 
tary Video 1). DNMT3A exists in dynamic equilibrium between auto- 
inhibitory and active forms, and the ADD domain oscillates between 
the two conformations. In the absence of histone H3, DNMT3A pre- 
fers an autoinhibitory form, in which the ADD domain binds to the CD 
domain and hinders its DNA-binding affinity. Once DNMT3A (or the 
DNMT3A-DNMT3L complex) is recruited to the nucleosome, H3K4me0 
binds to the ADD domain and stimulates DNMT3A to undergo a signi- 
ficant conformational change from an autoinhibitory form to an active 
form. The ADD-CD interaction in the active form allows DNMT3A to 
adopt a relative stable conformation so that DNA methylation occurs 
in a range permitted by the histone H3 tail. Even when DNMT3A is 
recruited to an H3K4me3-containing nucleosome, the enzyme will remain 
in its autoinhibitory form to avoid DNA methylation within an imper- 
missible chromatin environment. Our work reinforces the connection 
between DNA methylation and histone modifications, and sheds new 
light on the fine tuning ofthe establishment of DNA methylation. The 
structures may also provide a basis for the design of specific regulators 
for potential therapeutic applications. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version ofthe paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Protein expression and purification. The full-length DNMT3A2 (residues 224- 
912 of DNMT3A) was expressed in sf9 cells using the Bac-to-Bac system (Invitrogen). 
The infected cells were harvested and lysed in 50 mM Tris-Cl pH 8.0, 500 mM NaCl, 
and 0.01% 2-mercaptoethanol. The clarified lysate was applied onto GST affinity 
columns (GE Healthcare) and the fusion protein was cleaved with PreScission pro- 
tease. The protein was stored in 20 mM Tris-Cl pH 8.0, 300 mM NaCl, and 0.01% 
2-mercaptoethanol for the assays. 

Truncations of DNMT3A2 were cloned into modified pGEX-6p-1 vector and 
the CPNMTSL (residues 178-379) was inserted into modified pRSFDuet-1 vector. 
DNMT3A proteins were expressed independently or with CP" MP?! in Escherichia 
coli strain BL21(DE3). The transformants were grown at 37 ^C to an attenuance 
(Deoo nm) of 0.6 in 2X YT medium. The cultures were induced by adding 1 mM 
isopropyl-B-p-thiogalactopyranoside and further incubated for 16h at 15 ^C. The 
supernatant of cell lysate was applied onto GST affinity columns (GE Healthcare) 
and the fusion protein was digested with PreScission protease. The eluted protein 
was purified by ion exchange and gel filtration chromatography. The purified pro- 
teins were subjected to SDS-PAGE and stained by Coomassie blue. The peak fractions 
were concentrated to 5-10 mg ml ` and used for crystallization and biochemical 
assays. Mutations of DNMT3A were purified in a similar procedure. 
Crystallization and structure determination. For crystallization of the autoin- 
hibitory form of DNMT3A, the complex of ADD-CD of DNMT3A (residues 455- 
912) and CPNMTS! was mixed with a palindromic 18-base-pair DNA duplex (5'-G 
AGGCTAGCGCTAGCCTC-3’) and AdoHcy ina 1:1.2:2 molar ratio. The crystals 
were obtained using the hanging-drop, vapour-diffusion method by mixing 1 ul 
ADD-CD-C?X™™" complex with 1 pl reservoir solution containing 0.05 M Bis- 
Tris pH 5.6-6.0, 0.1 M sodium malonate and 8% PEG3350 at 4 ^C. Crystals were 
cryoprotected by the reservoir buffer containing 22% glycerol and then flash frozen 
in liquid nitrogen. Although the DNA duplex was added for the crystallization, no 
corresponding electron density was observed. The DNA duplex may function as an 
additive to favour the crystallization. 

For the crystallization of the active form of DNMT3A, the complex of ADD-CD 
of DNMT3A (residues 476-912) and CPNM™! was pre-incubated with histone H3 
peptide (residues 1-12, ARTKQTARKSTG) at a 1:10 molar ratio before crystalli- 
zation. Crystals were grown by the hanging-drop, vapour-diffusion method by 
mixing 1 ul protein (10 mg ml) with 1 ul reservoir solution containing 100 mM 
sodium acetate (pH 5.3-5.6) and 600 mM ammonium sulphate at 18 °C. Crystals 
were cryoprotected by the reservoir buffer with 25% ethylene glycol and then flash 
frozen in liquid nitrogen. 

The data were collected on beamline BL17U at Shanghai Synchrotron Radiation 
Facility, China, at wavelengths of 1.2816 A and 0.9792 A, respectively. Data were 
indexed, integrated, and scaled using the program HKL2000 (ref. 26). The orien- 
tation and position of CD-C°\™"*" in the ADD-CD-CPNMP*. complex was first 
determined by molecular replacement using CD-C°\™™*" (PDB accession num- 
ber 2QRV)” as a searching model in the PHASER program". The resulting CD- 
CPNMTSL model was refined with PHENIX package”. The ADD domain then was 
put into the refined extra-difference density using MOLREP with the "search for 
model in the map’ module”. The overall structure ofthe ADD-CD-CPNMPE com- 
plex was finally refined with stereochemistry and the reference structure ADD domain 
(PDB accession number 3A1B)'5 as restraints. The anomalous Fourier map of zinc 
cations in the ADD domain confirmed the validity of the position of the ADD domain. 
The structure of the ADD-CD-C?N™™*".3 complex was determined by molecu- 
lar replacement using the ADD domain (PDB accession number 3A1B)'* and CD- 
CPNMT3. (PDB accession number 2QRV)” as searching models in the PHASER 
program", and was then manually built by COOT”. 

All refinements used the module phenix.refine of PHENIX?*. The model quality 
was checked with the PROCHECK program". In the structure of ADD-CD- 
CDNMI3L 85,7% of residues were in most favoured regions, 12.996 in additional 
allowed regions, 0.9% in generously allowed regions, and 0.596 in disallowed regions. 
In the structure of ADD-CD-CP^MDL r5, 88.4% of residues were in most favoured 
regions, 1196 in additional allowed regions, and 0.696 in generously allowed regions. 
All structure figures were generated by 77. 

In vitro DNA methylation assay on naked DNA. The enzymatic activity of 
DNMT3A proteins was assessed by incorporation of a *H-labelled methyl group 
from S-adenosyl-1- [methyl-?H]methionine ([methyl-?H]AdoMet, PerkinElmer). 
A biotin-labelled DNA fragment amplified from the EBNA1 region of p220.2 
(1.2 kilobases, 52 CG sites) was used as a substrate. For histone H3 stimulation, 
0.3 uM DNMT3A proteins were pre-incubated with or without 3 uM histone 3 
peptides (residues 1-12). For ADD-mediated inhibition, CD proteins (1 uM) were 
supplemented with or without the ADD domain proteins (18 LM) on ice for 30 min. 
DNA (100 ng) was methylated by DNMT3A proteins in the presence of 2.5 uM 
[methyl-7H] AdoMet, 25 mM Tris-HCl (pH 7.5), 5% glycerol, 0.01% 2-mercaptoethanol, 
and 0.5 mg ml | BSA. The reactions were incubated at 37 °C for 30 min, and terminated 
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by adding cold wash buffer (500 mM NaCl and 1 mM EDTA in PBST). The DNA 
products were immobilized on streptavidin beads, washed five times, and subjected to 
liquid-scintillation counting (PerkinElmer). Each reaction was performed in triplicate. 
In vitro DNA methylation assay on recombinant poly-nucleosomes. Recombi- 
nant Xenopus laevis histones were expressed and purified as described". Site-specific 
methylation of H3Kc4me3 was performed by the methyl-lysine analogs approach”. 
Incorporation of H3K-4me3 was verified by specific antibodies (anti-H3K4me3: 
Cell Signaling, 9751 s; anti-H3: Abcam, ab1791) and visualized on Tanon-5200 Chemi- 
luminescent Imaging System (Tanon Science & Technology). Assembly of histone 
octamers and reconstitution of poly-nucleosomes were performed by salt dialysis 
using the 601 sequence (30 CG sites)**. The reaction mixture was the same as 
described above except that 300 ng nucleosomal DNA was methylated using 1 uM 
DNMT3A proteins. The reaction was quenched by the addition of excess TE 
buffer and 196 SDS. The histone proteins were removed by phenol-chloroform 
extraction. Then DNA was purified by ethanol precipitation, resolved in TE, and 
subjected to liquid-scintillation counting (PerkinElmer). Each reaction was per- 
formed in triplicate. 

Electrophoretic mobility-shift assay. A 6-carboxy-fluorescein (FAM)-labelled 
double-stranded DNA (dsDNA) containing one CpG site was generated from 
annealing two primers (upper primer, FAM-5'-CTGAATACTACTTGCGCICT 
CTAACCTGAT-3’; lower primer, 5'-GACTTATGATGAACGCGAGAGATTG 
GACTA-3’). The FAM-labelled DNA was used both in electrophoretic mobility- 
shift assays and fluorescence polarization assays. DNA (25 nM) and the indicated 
amounts of proteins were incubated in reaction buffer containing 20 mM HEPES 
pH 7.5, 100 mM KCI, 8% glycerol, and 0.5 mg ml ! BSA for 30 min at 25 °C. The 
samples were subject to a 1296 PAGE and analysed by Typhoon FLA 9500 (GE 
Healthcare) image scanning. 

Fluorescence polarization assay. FAM-labelled dsDNA (15 nM) was incubated 
with increasing amounts of DNMT3A proteins for 30 min at 25 ^C in reaction buffer 
containing 20 mM HEPES pH 7.5, 100 mM KCI, 8% glycerol, and 0.5 mg ml ! BSA. 
Fluorescence polarization measurements were performed on a Synergy 4 Microplate 
Reader (BioTek) at 25 ^C. The bound fractions were calculated as (mP — baseline 
mP)/(maximum mP — baseline mP), in which mP (milli-polarization units) repre- 
sents the fluorescence polarization value. For the peptide stimulation experiment, 
15nM DNA was pre-incubated with 0.2 uM. CD-CP™P or 1 uM ADD-CD- 
CPNMTSL protein, An increasing amount of histone H3 peptide (H3K4me0 or 
H3K4me3) was then added into the protein-DNA complex. The levels of protein- 
DNA complex formation were measured by fluorescence polarization. Each reac- 
tion was performed in triplicate. The curves were fitted using GraphPad Prism 5. 
Isothermal titration calorimetry. To obtain the binding affinity between ADD- 
CD-CPXM™" and H3 peptide, 0.04 mM ADD-CD-C?N™!" (in cell) in the absence 
or presence of 0.5 mM dsDNA (30 base pairs, upper strand: 5'-CTGAATACTAC 
TTIGCGCTCTCTAACCTGAT-3’) was titrated with 0.5 mM histone H3 peptide 
(residues 1-12) (in syringe) using an iTC200 microcalorimeter (GE Healthcare) at 
18°C. Protein, DNA, and peptide were prepared in a buffer containing 10 mM 
HEPES, pH 8.0, 100 mM NaCl, and 0.5 mM TCEP. To obtain the binding affinity 
between the ADD domain and H3 peptide, 0.05 mM ADD protein (in cell) was 
titrated with 0.5 mM histone H3 peptide (residues 1-12) (in syringe). The data 
were fitted by Origin 7.0 software. 

In vitro binding assay. For the GST pull-down assay, 30 ug DNMT3A-CD pro- 
teins were incubated with 8 ug GST-ADD-linker fusion proteins for 1h at 4°C 
in binding buffer containing 20mM Tris-HCl pH 8.0, 100mM NaCl, 0.01% 
2-mercaptoethanol, 5% glycerol, and 0.1% Triton X-100. GST-ADD-linker pro- 
teins were then immobilized to 25 ul of glutathione resins (GE Healthcare) for 1 h 
at 4°C. After washing three times with binding buffer, bound proteins were sub- 
jected to SDS-PAGE and stained by Coomassie blue. 

For the histone peptide pull-down assay, 1 ug biotinylated histone H3 peptide 
(residues 1-21) was incubated with 30 ug wild-type and mutant ADD-CD-CPNMT3L 
proteins for 1 h at 4 °C in binding buffer containing 20 mM Tris-HCl pH 8.0, 250 mM 
NaCl, 0.01% 2-mercaptoethanol, 5% glycerol, and 0.1% Triton X-100. Then 20 ul 
streptavidin beads were added into the mixture and incubated 1h at 4°C. After 
washing three times with binding buffer, bound proteins were subjected to SDS- 
PAGE and stained by Coomassie blue. 

Expression of '?F-labelled proteins. An orthogonal tRNA/tRNA synthetase sys- 
tem was used to incorporate '?F-labelled unnatural amino acid. Briefly, a TAG stop 
codon was introduced into the desired site to encode L-4-trifluoromethylphenylalanine 
(tfmF). A modified pEVOL-tfmFRS plasmid to express tRNAcua and tfmF-specific 
aminoacyl-tRNA was co-transformed with the TAG-carrying plasmid into 1 
(DE3)*°. The bacterial culture was induced with 0.0296 L-arabinose, 1 mM tfmF, and 
1 mM IPTG. The '’F-labelled proteins were purified as for wild-type DNMT3A 
proteins. 

1F NMR spectra measurements. All one-dimensional PF NMR spectra measure- 
ments were performed at 293 K on an Agilent 500 MHz spectrometer equipped 
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with an HFT probe, and the observation channel was tuned to 1F (470.2 MHz), 
with 1,024 free induction decay accumulations in every 4 s recycling delay. A one- 
dimensional °F spectrum was acquired with one pulse program with a 90° pulse 
width of 12.45 us and power at 57 W. The spectrum width was 60 p.p.m. and offset 
was —62 p.p.m. °F chemical shifts were referenced to an external standard, tfmF 
(—62.38 p.p.m.). The data were processed and plotted with an exponential window 
function (line broadening = 20 Hz) using ACD/NMR Processor Academic Edition 
software (ACD/Labs). The spectra of 0.14 mM ADD-CD F686tfmF or 0.12 mM 
ADD-CD F827tfmF with or without histone H3 peptides (residues 1-12) were 
collected at 293 K. 
Electron microscopy data collection. DNMT3A2- in the absence or 
presence of histone H3 peptide was analysed by negative-stain electron microscopy 
in the same manner. The samples were prepared by dilution of purified protein com- 
plex to 6.15 ug ml, then 4-5 ul of this sample was deposited onto a glow-discharged 
400 mesh continuous carbon grid (Beijing Zhongjingkeyi Technology). The sam- 
ple was then stained with 2% uranyl formate and air-dried. Data were recorded on 
a Tecnai G2 F20 TWIN transmission electron microscope (FEI) equipped with a 
field-emission gun operated at 200 kV. Images were recorded at X71,000 micro- 
scope magnification on a 4k X 4k Eagle CCD (charge-coupled device) camera with 
a pixel size of 1.15 A per pixel. The defocus ranged from —0.5 to —0.8 pm. 
Electron microscopy image processing and three-dimensional reconstruction. 
For this, 9,864 and 21,910 particles were boxed out for DNMT3A2-CPNMI3. and 
DNMT3A2-CP™PPL_H3, respectively, by using the e2boxer.py program in EMAN2.1 
(ref. 37). Contrast transfer function parameters were determined for particles boxed 
out from each CCD image using EMAN1.9 procedure ctfit, followed by phase 
flipping using the applyctf program. The data were then low-pass filtered to 10 A to 
enhance the image contrast for three-dimensional reconstruction**”’. Reference- 
free two-dimensional analysis used the EMANI.9 program refine2d.py and IMAGIC”, 
and those class-averages were used to generate initial models by e2initialmodel.py 
in EMAN2.1. Three-dimensional reconstruction was performed by the EMAN1.9 
program refine*^?, Initially, no symmetry was imposed in the reconstruction 
process, and the resulting three-dimensional reconstruction revealed the existence 
of a two-fold symmetry in both maps but in different locations, which was subse- 
quently imposed in the reconstruction process. The final resolution was estimated 
at 24 A and 20 A, respectively, by the 0.5 FSC criteria using the eotest program in 
EMAN1.9. 

UCSF Chimera (http://www.cgl.ucsf.edu/chimera/)* was used to render the elec- 
tron microscopy density together with the crystal structures. In addition, the Fit In 


DNMT3L 
C 


Map module in Chimera was used for rigid-body fitting ofthe crystal structure into 
the corresponding electron microscopy density map. 
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Extended Data Figure 1 | In vitro DNA methyltransferase activity of 
DNMT3A. a, In vitro DNA methyltransferase activities of DNMT3A2 
(purified from insect cells) in the absence or presence of CP"MDL The assays 
were performed in the presence or absence of histone H3 peptide (residues 
1-12). Note that CPN?" could enhance the activity of DNMT3A2 by a factor 
of 2-3, which is consistent with previous study”. However, histone H3- 
mediated activation of DNMT3A is independent of the existence of CONM™", 
b, In vitro DNA methyltransferase activities of DNMT3A2 (purified from 
insect cells) or DNMT3A2-CPNMT3L (purified from bacteria) in the presence or 
absence of histone H3 peptides. c, Enzymatic activities of DNMT3A2 (purified 
from insect cells) or DNMT3A2-CP*MP?L (purified from bacteria) using 
reconstituted nucleosomes as substrates. Nucleosomes containing unmodified 


histone H3 or H3K-4me3 were subject to SDS-PAGE and visualized using 
specific antibodies. d, e, Enzymatic activities of various N-terminal deletions of 
DNMT3A2 in the absence (d) or presence (e) of ۷ Corresponding 
relative activities are indicated at the bottom of each figure. CPM, counts per 
minute. Error bars, s.d. for triplicate experiments. The ADD-CD or CD protein 
purified from bacteria was not stable in solution and tended to precipitate out, 
which may have resulted in their lower activities under our experimental 
conditions (compared with DNMT3A2 purified from insect cells). Because 
CPNMTS could stabilize DNMT3A and had no effect on histone H3-mediated 
activation, protein complexes ADD-CD-CP™PY and CD-CPNM™" were 
used in the following studies if not specified. 
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Extended Data Figure 2 | Crystal structure of ADD-CD-CP™PP in 
autoinhibitory form. a, b, Two different views of the 2F observed — Fealculated 
maps for CPNMT3L (a) and CD (b) domains in the ADD-CD-C?NM™" 
structure. The maps were calculated at 3.82 A and contoured at 1.50. Only 
main-chains are shown for simplicity. c, The 2F observed — Fealculatea maps for the 
ADD domain after refinement of the CD-CP"MP. complex (top) and after 
refinement of the ADD-CD-CP*MT*- complex (bottom). The maps were 
calculated at 3.82 A and contoured at 0.8. Main-chains from most residues, 
including residues 526-533 involved in the interaction with CD domain, fit 
well into the electron density. Some loop regions were not well covered by 
electron density, which is consistent with a high B factor (Extended Data 

Fig. 8a) of the ADD domain in the complex structure, supporting the dynamic 
feature of the ADD domain for regulating enzymatic activity of DNMT3A. 


d, Zn-anomalous difference map contoured at 3.56 shows the positions of zinc 
cations in the ADD domain. e, Gel filtration profiles for standard proteins and 
the ADD-CD-CPNMT3L complex The peak position corresponds to the dimer 
of ADD-CD-CPM™ with a molecular mass of about 140 kDa. f, Dimer 
formation of the ADD-CD-CP"MT?- complex in crystals. The dimer of 
ADD-CD-C°\™™*" complexes is mediated by CD-CD interaction in a 
two-fold crystallographic symmetry. Given the difficulty in tracing the 
conformation of the side chain in 3.82 A resolution structure, we have not 
discussed the specific hydrogen bond or hydrophobic interaction within 
ADD-CD-CP^MTt. Residues 832-846 of DNMT3A were not built in the 
model because they lacked electron density, which may have resulted from their 
flexibility in crystals. 
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Extended Data Figure 3 | Structure of ADD-CD-CPNMP3. in 

autoinhibitory form. a, Superimposition ofhuman ADD-CD-C 
mouse CD?™82_CP2™3L (lack of ADD domain, PDB accession number 
2QRV)? structures shown in ribbon representations. CD- CP" MP j 


DNMT3L 


with 


in two 
structures is well aligned with a root mean squared deviation of 1.28 À for 723 
Ca aligned. The function of DNMT3A-DNMT3L complex dimerization has 
been characterized in a previous study”. The functions and structures of the 
CD and CPNM™*" domains, and the CD-CD and CD-C?N™™" interfaces, were 
not discussed in this work. b, Overall structure of ADD-CD-CPNMDPE with 
CD-CPNMT! shown in electrostatic potential surface, and the ADD domain 
and linker shown in ribbon representation. The linker packs against a 
hydrophobic surface of the CD domain. c, d, Close-up view of linker-CD 

(c) and ADD-CD (d) interfaces with the electrostatic potential surface of the 


ADD. دوم‎ Dat 


IN DNMT3AADD 
| DNMT3ACD 


WÎ DNMT2A linker 


0011138 _ Dnmt3l 
CD -C 


Dnmt3a-Dnmt3l 


DNMT3A 
Autoinhibitory form 


mm 
DNMT3L-H3 complex 


LETTER 


DNMT3AADD 
DNMT3A CD 
DNMT3A linker 


DNMT3L ADD 
DNMT3L C-Like 


CD domain indicated. Critical residues are shown in stick representation. 

e, Superimposition of ADD-CD-CP*MP and DNMT3L-H3 structures 
(PDB accession number 2PVC)? shown in ribbon representations in two 
different views. The CD domain and C-like domain of DNMT3L were aligned 
for comparison. Note that the extendedloop ofthe ADD domain in ADD-CD- 
CPNMT. overlaps with an a helix in the DNMT3L-H3 structure. DNMT3L 
is unlikely to adopt a similar conformation to that of ADD-CD-CP5MT3L 
because otherwise the ADD domain will have steric hindrance with the C-like 
domain of DNMT3L (dashed circle). According to the above analyses, the 
structure ofthe autoinhibitory form of DNMT3A could not be predicted on the 
basis ofthe DNMT3L structure because the overall structures of DNMT3A and 
DNMT3L are different. 
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HhaI 144 DYSFHAKVLNALDYGI POKRERI YMICFRNDLNIQNFQFPKPFELNT FVKDLLI§PD SEVEHLVIDRKD LVMTNQEMERTT PKTVRLGIVGRGGOGERIYST 244 


hDNMT3A 860 912 
mDnmt3a 856 908 
zDntm3a 677 731 
hDNMT3B 801 853 
hDNMT3L 339 386 
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x Residues involved in ADD-CD interaction in active form 
v Residues involved in ADD-CD interaction in autoinhibitory form 
= Residues involved in histone H3 interaction 


Extended Data Figure 4 | Sequence alignment of DNMT3 family members. structural elements are coloured as in Fig. 1a and indicated above the sequences. 


Sequences of human DNMT3A (NP_072046), DNMT3B (NP_008823), Invisible residues in the structure of ADD-CD-C?N™*" are indicated as 
DNMT3L (NP_787063), mouse Dnmt3a (NP_031898), zebrafish Dnmt3a dashed lines above the sequences. Residues involved in ADD-CD interactions 
(NP_001018150), and DNA methyltransferase from Haemophilus in active form or autoinhibitory form are indicated as black stars and red 
parahaemolyticus (WP_005706946) used in the alignment. Highly conserved ` triangles, respectively. Residues involved in H3-ADD interactions are 

and identical residues are highlighted with dark green background, and indicated as blue squares. 


conserved residues are indicated with light green background. Secondary 


©2015 Macmillan Publishers Limited. All rights reserved 


a 


LETTER 


Ke AV Ke NÉI 
E SU شي‎ ose oF 
X ام‎ A X کي لم‎ oF Af d x «O 
Po «E SP ES ae d ar a Pas P A 
ص لي کي لي ې شي بي کې‎ q d do کہ کہ‎ SP d d 
wm ` emm کي‎ — ee emm = mmm سي — ېټ پس بي سه‎ «- GST-ADD 
-Linker 
—— = سا‎ —- -—-— o 0 «CD 
mmm — س س سے کے = — سس‎ zm - «GST 
Control Pull-Down 
b e X wv wv 
y d d JS EU 
«S Y c Y S © 9 9 CNN 
SA di oD oD A SLE LE LEE” 
LP CPL CPL c zs = : 
ea wu» & wm = + GST-ADD -> p » GST-ADD 
-Linker رر سی نه‎ ye -Linker 
æ= — - Ger س س‎ «GST 
Pull-Down Control 2% Input Pull-Down Control 
S & 
e e S 
d کي کي‎ 9 f EB No peptide 
S GF ok SA 
«S XN NU +H3K4me0 
کے‎ go ^ Sy 1000 
SPP PA EE? 
فاق مد‎ a «- GST-ADD 
= -Linker 
س —— دو سا‎ «CD 
—_ -— «-GST 
Pull-Down Control 
e a? e$ 
N N 
ŞS SPS WT 621-632 to GS linker 
& AAT کې‎ SAS SE 
d PORE l ADD-CD.C Pra 
سه‎ Sal): Seng 
A eo Saal c l 
- WW. Ger 
Control Pull-Down 


Extended Data Figure 5 | Interactions between the ADD and CD domains. 
a, b, GST pull-down assays with recombinant CD (residues 627-912) 

protein incubated with wild-type or mutant GST-ADD-linker proteins 
immobilized on glutathione resin. The bound proteins were analysed by 
SDS-PAGE and Coomassie blue staining. c, GST pull-down assays using 
wild-type or mutant ofthe CD domain. d, GST pull-down assays in the absence 


or presence of histone H3 peptide (H3K4me0 or H3K4me3). e, GST pull-down 
assays with the CD domain incubated with GST-ADD or GST-ADD-linker 
proteins immobilized on glutathione resin. f, Activities of wild-type and mutant 
ADD-CD. Residues 621-632 were replaced by a GS linker in the 

mutant proteins. 
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Extended Data Figure 6 | Interactions between DNMT3A and DNA. 

a, Enzymatic activities of wild-type and mutant ADD-CD-C°\™"*", Residues 
on the missing loop (residues 831-846) were mutated for the in vitro DNA 
methyltransferase activity assay. Error bars, s.d. for triplicate experiments. 
Mutating above residues leads to loss of activity of ADD-CD-CPNMDL 
supporting their important role in catalysis or DNA recognition. The missing 
loop in the ADD-CD-CP'MT*t structure is equivalent to a DNA-binding 
loop in the HhaI-DNA structure. b, DNA has no effect on the interaction 
between histone H3 and DNMT3A. Left, isothermal titration calorimetry 
enthalpy plot for the binding of isolated ADD domain (in cell) to histone H3 
peptide (residues 1-12, in syringe), with the estimated binding affinities (Ka) 
listed. Right, superimposed isothermal titration calorimetry enthalpy plots for 
the binding of ADD-CD-CP™PP? (in cell) to histone H3 peptide (residues 
1-12, in syringe) in the absence or presence of dsDNA. The estimated 


binding affinities (K4) are listed. Histone H3 peptide has comparable binding 
affinity to the ADD domain alone (1.75 pM) and ADD-CD-CP"MP in 
autoinhibitory form (2.14 uM), and the addition of DNA was not able to 
enhance the binding affinity further. The presence of DNA led to a slight 
decrease in the binding affinity between histone H3 peptide and ADD-CD- 
CPNMTSL. which may have resulted from slight precipitation of the protein 
caused by the high concentration of DNA used for titration. c, Electrophoretic 
mobility-shift assays for DNMT3A proteins in the absence or presence of 
histone H3 peptide, with protein concentrations indicated. H3-ADD-CD 
represents a fusion protein with histone H3 (residues 1-20) at the N terminus of 
ADD-CD. The assays showed that CD-CP"MT?. strongly bound to the FAM- 
labelled DNA duplex, whereas the existence of the ADD domain markedly 
decreased DNA-binding affinity, which was partly restored by the addition of 
histone H3 peptide or largely restored by H3-ADD-CD fusion protein. 
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Extended Data Figure 7 | Structure of ADD-CD- CP"MDP'. H5 in active 
form. a, Ribbon representations of the overall structures of the ADD-CD- 
CPNMTSL in active (left) and autoinhibitory (right) forms. Histone H3 peptides 
are coloured in yellow. b, Structural comparison of human ADD-CD- 
CPNMTSL_H3 and mouse CD?"™3*_C?™™" complexes. The compared 
structures are shown in ribbon representations. The ADD-CD-C?N“™*" 
complex structure (this study) is coloured as in Fig. 1d, and the C 
CPnm8l complex structure” is coloured in grey. Residues 611-620 and 
833-846 of DNMT3A were not built in the model because they lacked electron 
density. c, LIGPLOT representation of the ADD-CD interactions in the 
ADD-CD-C?\™™**.H3 structure. Carbon, oxygen, and nitrogen are shown 
as black, red, and blue balls, respectively. Hydrogen bonds are indicated as 
dashed lines, with lengths given in A. d, Close-up view of the ADD-CD 
interface. Critical residues for the interactions are shown in stick 
representation, and hydrogen bonds are indicated as dashed lines. The C 
terminus (residues 903-911) of the CD domain and a loop region (residues 
621-632) together form a flat patch for interaction with the ADD domain. 
Hydrogen bonds are formed between residues N551, N553, and R556 of the 


DPamtsa_ 


ADD domain and residues E629, C911, and E907 of the CD domain. Residues 
Y526, Y528, W601, and F609 of the ADD domain, V622 and P625 of the linker, 
and R803 and P904 of the CD domain are involved in hydrophobic 
interactions. e, Structural comparison of ADD-CD-H3 in ADD-CD- 
CPNMT3L_H3 (this study) and DNMT3L-H3 structures (PDB accession 
number 2PVC)". Two compared structures are shown in ribbon 
representations with ADD domains (left) or catalytic domains (right) 
aligned, respectively. The DNMT3L-H3 structure is coloured in grey. When 
the ADD domains are superimposed, the catalytic domain moves with a longest 
distance of 19 À. When the CD domains are superimposed, the ADD 
domain moves 6 A. f, Close-up view ofthe H3-ADD interface. Critical residues 
for the interactions are shown in stick representation, and hydrogen bonds 
are indicated as dashed lines. The fashion of histone H3- ADD interaction is 
similar to that observed in the structure of the H3-ADD fusion protein’. 

g, Histone H3 peptide pull-down assay. Recombinant wild-type and mutant 
ADD-CD-C?™ proteins were incubated with biotinylated histone 3 
peptide (residues 1-21) and immobilized onto streptavidin sepharose beads. 
Bound proteins were subjected to SDS-PAGE and stained by Coomassie blue. 
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Extended Data Figure 8 | Conformational change of DNMT3A induced by 
histone H3 tail. a, Average B factors for domains of ADD-CD-CPMMT3L in the 
structures of ADD-CD-CP"MP* and ADD-CD-C?N™™" bound to H3 
peptide. The average B factor of the ADD domain is higher than other domains 
in both structures, and is higher in autoinhibitory form (177.5 A”) than that in 
active form (107.6 A”). The results indicate that the ADD domain is more 
dynamic than other domains of the complex, especially in its autoinhibitory 
form. The observation further supports the idea that DNMT3A undergoes 
conformational changes on the ADD domain induced by histone H3. b, Two 
different views of the electron microscop density maps of DNMT3A2- 
CPNMTSL (left) and DNMT3A2-CPNMPE H3 (right) processed to 24 À and 
20 À resolution, respectively. The corresponding crystal structure was fitted 
into the electron microscopy density map for each state. The density is not fully 
occupied, which might because of the missing PWWP domain in the crystal 
structures. c, Typical negative stain CCD images of DNMT3A2-CPNMT?t (left) 
and DNMT3A2-CP^MP. Hä (right). Representative particles are highlighted 


by white boxes. d, Comparison of the two-dimensional projections (bottom) 
from the electron microscopy map with the corresponding reference-free 
two-dimensional class averages (top) reveals similar structural features. 

e, Position of residues F827 and F868 for ^F NMR measurements. Close-up 
view of the DNMT3A structure in autoinhibitory form with residues F827 and 
F868 indicated in stick representation. Residue F827 is located in loop L2 
(for DNA binding) and close to the ADD domain. As a negative control, residue 
F868 is located close to the catalytic cavity and away from the ADD domain. 
Residue F868 is unlikely to undergo conformational change when the ADD 
domain dissociates from the CD domain. To detect conformational changes 
of DNMT3A in solution, residues F827 and F868 were substituted by 
'?F-Jabelled 1-4-trifluoromethylphenylalanine ('°F-tfmF) in ADD-CD. 

f, One-dimensional "" NMR measurements were performed using ADD-CD 
with substitution of F827tfmF (left) or F868tfmF (right) in the absence or 
presence of H3K4me0 or H3K4me3 peptide. The chemical shift for each 
measurement is indicated. 
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Extended Data Table 1 


Data collection 
Space group 
Cell dimensions 

a, b, c (A) 

a, BY © 
Resolution (Å) 
Bem or Rinerge 
I/ol 
Completeness (%) 
Redundancy 


Refinement 
Resolution (A) 
No. reflections 
Reverie Ree 
No. atoms 
Protein 
H3 peptide 
Ligand/ion 
Water 
B-factors (A’) 
Protein 
H3 peptide 
Ligand/ion 
Water 
R.m.s deviations 
Bond lengths (A) 
Bond angles (°) 


* Highest resolution shell is shown in parentheses. 


Data collection and refinement 
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complex 
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20564 

0.230 / 0.273 

5043 


29 


ADD-CD-CP"MBL 
complex with H3 peptide 


P 6; 


183.8, 183.8, 123.3 
90, 90, 120 

50.0 — 2.90 (3.00 — 2.90) 
0.095 (0.721) 

20.5 (3.5) 

99.0 (100.0) 

9.9 (9.8) 


50.0 — 2.90 (3.00 — 2.90) 
52407 
0.223/ 0.261 
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Extended Data Table 2 | Effect of DNMT3A mutants on autoinhibition 


H3K4me0 recognition (residue K4) Release 
ADD-CD interface (autoinhibitory form) 


i ect 
D531A H3K4me0 recognition (residue K4) Release 
ADD-CD interface (autoinhibitory form) 
. oi: N . 


Residues 809-813 replaced by GGSGG [Potentially for DNA recognition 


Residues 821-846 replaced by | Potentially for DNA recognition Decrease 
GGSGGSGG 


All mutants were made on ADD-CD (residues 476-912) and the protein complexes ADD-CD-CP™MT3L were used for the assays. ‘Release’ represents release of the autoinhibition by at least twofold activity 
enhancement. ‘Inhibition’ represents inhibition of activity and no response to H3 peptides. ‘No-change’ indicates that the mutants behaved similar to wild-type protein. ‘Decrease’ represents decrease or loss of 
enzymatic activity. 
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